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Abstract 

Large deviation principles are established for the two-parameter Poisson-Dirichlet distribu- 
tion and two-parameter Dirichlet process when parameter approaches infinity. The motivation 
for these results is to understand the differences in terms of large deviations between the two- 
parameter models and their one-parameter counterparts. New insight is obtained about the 
role of the second parameter a through a comparison with the corresponding results for the 
one-parameter Poisson-Dirichlet distribution and Dirichlet process. 
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1 Introduction 

For > and a in (0, 1), let C/^, A; = 1, 2, be a sequence of independent random variables such 
that Uk has Beta{l — a,0 + ka) distribution. Set 

Xf'" = Ui, X^'" = (1 - ?7i) • • • (1 - Un-i)Un, n>2. (1.1) 



Then with probability one 



and the law of {X^' , Xg' , ...) is called the two-parameter GEM distribution denoted by GEM{9, a) 



'Research supported by the Natural Science and Engineering Research Council of Canada 



1 



Let P{6,a) = {Pi{9, a), P2{0,a), ...) denote {X^'"^ , , ■■■) in descending order. The law of 
P{a,6) is called the two-parameter Poisson-Dirichlet distribution and is denoted by PD{9,a). 

Let (,k,k = 1, ... be a sequence of i.i.d. random variables with common diffusive distribution v 
on [0, 1], i.e., z^(x) = for every x in [0, 1]. Set 

00 

k=l 

We call the law of 'Ee^a,^ the two-parameter Dirichlet process, denoted by Dirichlet{6,a,v). 

If a = in (II. ip . we get the well known GEM distribution, the Poisson-Dirichlet distribution, 
and Dirichlet process denoted respectively by GEM{6), PD{6), and Dirichlet{6,u). 

There is a vast literature on GEM distribution, the Poisson-Dirichlet distribution, and Dirichlet 
process. The areas where they appear include Baysian statistics (|9]), combinatorics ([20]), ecology 
([in])) population genetics ([?]), and random number theory ([21j). 

The Poisson-Dirichlet distribution was introduced by Kingman [12j to describe the distribution 
of gene frequencies in a large neutral population at a particular locus. In the population genetics 
setting it is intimately related to the Ewens sampling formula that describes the distribution of the 
allelic partition of a sample of size n genes selected from the population. The component Pk{0) 
represents the proportion of the A;th most frequent alleles. If u is the individual mutation rate and 
A'^e is the effective population size, then the parameter 9 = AN^u is the scaled population mutation 
rate. 

The GEM distribution can be obtained from the Poisson-Dirichlet distribution through a pro- 
cedure called size-biased sampling. Here is a brief explanation. Consider a population consisting of 
individuals of countable number of different types labelled {1,2, ...}. Assume that the proportion 
of type i individual in the population is pi. A sample is randomly selected from the population and 
the type of the selected individual is denoted by cr(l). Next remove all individuals of type cr(l) from 
the population and then randomly select the second sample. This is repeated to get more samples. 
Denote the type of the ith selected sample by cr{i). Then {Pa(i),Pa{2)T--) is called a size-biased 
permutation of {pi,p2, ■■■)■ Then sequence X^,k = 1,2,... defined in (jl.lh with a = has the 
same distribution as the size-biased permutation of P(^) = P(^,0). The name GEM distribution 
is termed by Ewens after R.C. Grifffiths, S. Engen and J.W. McCloskey for their contributions to 
the development of the structure. The Dirichlet process first appeared in [9]. 

The literature on the study of Poisson-Dirichlet distribution and Dirichlet process with two 
parameters is relatively small but is growing rapidly. Carlton [2j includes detailed calculations 
of moments and parameter estimations of the two-parameter Poisson-Dirichlet distribution. The 
most comprehensive study of the two-parameter Poisson-Dirichlet distribution is carried out in 
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Pitman and Yor [T8]. In [6] and the references therein one can find connections between two- 
parameter Poisson-Dirichlet distribution and models in physics including mean- field spin glasses, 
random map models, fragmentation, and returns of a random walk to origin. The two-parameter 
Poisson-Dirichlet distribution also found its applications in macroeconomics and finance ([l]). 

The Poisson-Dirichlet distribution and its two-parameter counterpart have many similar struc- 
tures including the urn construction in [TT] and [8], GEM representation, sampling formula (|16j). 
etc.. A special feature of the two-parameter Poisson-Dirichlet distribution is included in Pitman 
|15j where it is shown that the two-parameter Poisson-Dirichlet distribution is the most general 
distribution whose size-biased permutation has the same distribution as the GEM representation 

The objective of this paper is to establish large deviation principles (henceforth LDP) for 
GEAI{6,a), PD(9,a), and Dirichlet{6,a,u) when 9 approaches infinity. Noting that for the 
one-parameter model, 9 is the scaled population mutation rate. For fixed individual mutation rate 
u, large 9 corresponds to large population size. In the two parameter setting, we no longer have 
the same explanation. But it can be seen from (jl.ip that for nonzero a, large 9 plays a very similar 
role mathematically as in the case a = 0. 

LDP for Dirichlet(9, v) has been established in |13j and [S] using different methods. Recently 
in the LDP is established for PD{9). From (jl.ip . one can see that for every fixed the 
impact of a diminishes as 9 becomes large. It is thus reasonable to expect similar LDPs between 
GEM [9, v) and GEAI{9, a, v). But in PD(9, a) and Dirichlet{9 , a, u), every term in (jl.ip counts. 
It is thus reasonable to expect that the LDP for PD{9) and Dirichlet{9,v) are different from the 
corresponding LDPs for PD{9,a) and Dirichlet{9,a,v). But it turns out that the impact of a 
only appears in the LDP for Dirichlet{9 , a, v). 

LDP for GEM{9, a) is given in Section 2. Using Perman's formula and an inductive structure, 
we establish the LDP for PD{9, a) in Section 3. The LDP for Dirichlet{9 , a, v) is established in 
Section 4 using the subordinator representation in [18] and a combination of the methods in [13j 
and Further comments are included in Section 5. 

The reference [5] includes all the terminologies and standard techniques on large deviations 
used in this article. Since the state spaces encountered here are all compact, there is no need to 
distinguish between a rate function and a good rate function. 
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2 LDP for GEM 

Let E = [0, 1], and E°° be the infinite Cartesian product of E. Set 

oo 

£ = {{xi,X2,...)eE'^ :Y.Xk< 1}, 

k=l 

and consider the map 

G: E°° ^ £,{ui,U2,...) (xi,X2,..) 

with 

Xi = Ui,Xn = Un{l - Ui) • ■ ■ (1 - li„_l), n>2. 

By a proof similar to that used in Lemma 3.1 in [3], one obtains the following lemma. 

Lemma 2.1 For each k > 1, the family of the laws of Uk satisfies a LDP on E with speed 9 and 
rate function 

f log-r^, u e [0,1) 
Ii(n) == <^ ^1-"' ^ ' ^ (2.3) 

I oo, u = 1 

Theorem 2.2 The family {GEM{9,a) : 6 > 0,0 < a < 1} satisfies a LDP on 8 with speed 6 and 
rate function 

or , j l°gT3^' E£iX.<l 

S{XI,X2,...) == < 

I cxD, else. 

Proof: Since Ui,U2,.. are independent, for every fixed n the law of (t/i, C/„) satisfies a LDP 
with speed 9 and rate function ^27=1 -^i(^i)- -^o^ ^j'^ E°° , set 



|u-v| =^ 



oo I I 
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i=l 

Then for any 5" > and u in E'^, one can choose n > 1 and small enough < 6' < 5 < 6" 
such that 

{v (£E°° : max \vi - Ui\ < 6'} C {v € -E"" : |v - u| < S}, 

l<i<n 

{v G : |v - u| < 5} C {v G : max \vi - Ui\ < 6"}, 

l<i<n 



which implies 



1 " 
hm hminf - log P{|(C/i, [/2, ...) - u| < 5} > - 5] L,{ui), (2.4) 

5^0 y— »oo u — : 

4 = 1 
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limlimsup-logP{|([/i,[/2,...) -u| < 5} < -J^hM. (2.5) 

5^0 e^oo (7 ^ 



Since E°° is compact, by letting n approach infinity in (j2.4p and (j2.5p . it follows that the law 
of (Ui, U2, •••) satisfies a LDP with speed 9 and rate function J2i^i -^i(^i)- The effective domain of 

C = {u G E°° : Ui < l,^Ui < 00} 

i=l 

and 

Since the map G is continuous, it follows from contraction principle and Lemma 12.11 that the 
family {GEM{6,a) : > 0, < a < 1} satisfies a LDP on £ with speed 6 and rate function 

00 

inf{^/i(ni) : ui = Xi,n2(l - ui) = X2, ■■■}■ (2.7) 

1=1 

For each 1 < n < +00, 

n 

{l-Ul)■■■{l-Un) = l-Y.X^. (2.8) 

1=1 

Hence if X^iLi Xi = 1 for some finite n, then one of ui, is one, and u is not in C. If J27=i < 1 
for all finite n and J2iZi = 1, then from (j2.8p 



lim ^00(1 — Ui) = —00, (2.9) 

J, — >nc! ^ — ^ 



n—^oo 

i=l 



which implies that J2^i '^i = 00. Thus the inverse of the set {{xi, ...) E £ : X^i^i = 1} under G 
is disjoint with C. Hence the rate function in (12. 7p is the same as 5(xi,X2, ...). 



□ 



3 LDP for Two-Parameter Poisson-Dirichlet Distribution 

In this section, we establish the LDP for PD{9,a). Recall that P{9,a) = {Pi{9,a),P2{9,a)...,) is 
the random probability measure with law PD{9,a). When a = 0, we will write 

P{9) = P{9,0) = {Pi{9),P2i9)...,). 

3.1 Perman's Formula 

For any constant C > 0, /3 > 0, let 

h{x) = aCx-("+^\a; > 0, 

and 



^ r(/3 + l)(CT(l-a))^/'^ 
''"•^ TiP/a + 1) 
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Let iplt) be a density function over (0, oo) such that for all /? > —a 



t-^'il){t)dt = —. (3.10) 

Ca^p 



Set 



ipi{t,p) = h{tp)tilj{tp),t > 0,0 <p < l,p= 1 - p (3.11) 
, , i h{tp)tC Mtp,q)dq, p < l/(n + 1) 

i'n+l[t,p) = < (3.12) 

I 0, else. 
Then the following result is found in [ri](see also [18j). 

Lemma 3.1 (Perman's Formula) For each k>l, let f{pi, ■■■,Pk) denote the joint density function 
of{Pi{a,6),...,Pk{a,e)). Then 



oo 



f{pi,...,Pk) = Ca,e t gkit,pi, ...,pk)dt, (3.13) 
Jo 

where for k > 2,t > 0, < pk < ■ ■ ■ < Pi,Y.i=iPi < 1; and pt = I - pi -Pk-i, 

I. ^ t^-^h{tpi)---h{tpk-i) Pk. 

gk{t,pi, ■.■,Pk) = 9i{tPk, — ) (3.14) 

Pk Pk 

and 

oo 

5i(t,p) = E(-ir^Vn(t,p). (3.15) 

n=l 

3.2 LDP for PD(^, a) 

We will prove the LDP for PD{6,a) by first establishing the LDP for Pi{9,a) and 
{Pii9,a),...,Pk{e,a)) for any A; > 2. 

Lemma 3.2 The family of the laws of Pi{9, a) satisfies a LDP on E with speed 9 and rate function 
hip) given in <\2.3\i . 



Proof: It follows from the GEM representation that 

E[e^^^'] < ^[e^^^i(^'°)] for A > 0; 
i?[e^^^^] > ii;[e"^^^(^'")] for A < 0. 

On the other hand, from the representation in Proposition 22 of [18J we obtain that 

S[e^^^^(^'-)]>i5;[e^^^^W]for A<0. 
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Since both the laws of Ui and Pi{0) satisfy LDPs with speed 6 and rate function /i(-), we 
conclude from Lemma 2.4 of [4J that the law of Pi(^,a) satisfies a LDP with speed 9 and rate 
function 

□ 

Lemma 3.3 For each fixed k > 2, let 

k 

Vfc = {p = {pi,:;Pk) : <pfc < ••• <Pi,^Pi < 1}, 

■t=i 

and Ve,k be law of {Pi{6,a), Pk{0,a)). Then the family {Ve^t : ^ > 0} satisfies a LDP on Vfc 
with speed 9 and rate function 

Ik{Pi,-,Pk) == < Z.,=iP» 

I oo, else. 

Proof: For /c > 2,t > 0, < pfc < • • • < Pi,Ei=iPi < 1, it follows from (|3l^ . UiSAM and (l3l^ 
that 



fiPi,.:,Pk) = . / t-[^+(^-^)"],,(tpfc,^)dt 

= ""'^^"^^ / (3.16) 



pKpi ■ • •Pfc-i)"^^ ^ ' Pk 

Ca,e(aC) Pfc / m+(k-l)a]i / Pk 



Set 



Then 



/•oo 







oo 



(t)i{u) = I s~^^+^^-^^"'Uh{su)^{su)ds 

{su)ds 



{aC)u~ 


ia+l) r^-^O+ka]^^^ 


Jo 




ia+l)-e+ka~l f"^ 


Jo 


{aC)u~ 


■ia+l)-e+ka-l 1 


Ca,6+ka 



9+ 



(3.17) 



For any n > 1, and u < it follows from (j3.12p that 



i^n+iit,u) (3.18) 
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= th{tu) / ipn{tu,ui)du\ 

Ju/u 

= th{tu) I [X{ux<l/n}{tu)h{tuUi) I tpn-l{tuUl,U2)dU2]dui 

Ju/u Ju\ju\ 

rl/(n-l) rl 

= t h{tu) / • • • / [{tuh{tuUi)) ■ ■ ■ {tuUi ■ ■ ■ Un-2h{tuUi ■ ■ ■ Un-2Un-l))] 

Ju/u Jun-l/Un-l 

x[X{«i<l/n} • • •X{«„-i<l/2}]'0l(*^*'"l • ■ ■ Un-2Un-l,Un)dUndUn-l ■ ■ ■ dui 
= t h{tu) I ■ ■ ■ [{tuh{tUUi)) ■ ■ ■ [tuUi ■ ■ ■ Un-2h{tUUi ■ ■ ■ ■Un-2Un-l))] 

Ju/u Jun-l/Un-l 

x[X{«i<l/n} • • • X{«„_i<l/2}] • • • Un-2Un-l)h{tuUi ■ ■ ■ Un-2Un-lUn)] 

Xtp{tuUi ■ ■ ■ Un-2Un-lUn)dUndUn-l ■ ■ ■ dui 

= (aC)''+4"("+^)° / / • • • / [X{«i<i/n} • • • X{u„.^<i/2}]dundun-i ■■■dui 

Ju/u Jui/ui J Un—l/Un — 1 

X{{UUI ■ ■ ■ li„)"("+^^'U""'"tt^^"'"^^" • • • U~^^'lp{tUUi ■ ■ ■ Un-2Un-lUn)}- 

Integrating over t we get 

(l>n+l{u) = (aC)"+^ / / ■■■ [ [X{ui<l/n}---X{un-i<l/2}]dUndUn-l---dui 

Ju/u Jui/ui Jun-l/u„-l 

X {(mil • • • Kn)"^°+'^n-""«^("-')" • • • U-^, 

X [ / • • • Un-2Un-lUn)dt]} (3.19) 

Jo 

= (aC)"+^-u-('*+^)u(^+*^"-^) ^ An{a,9){u) 



Ania,9)( 

^a,6+{n+k)a 



where 



Ania,9){u) = f f ■■■ j [X{ui<l/n}---X{ur,-i<l/2}]dUndUn-l---dui 

J uju J U\lu\ J Un—l/Un—l 

x{(ui • • • ^.^-("+1) • • • 

Let 

D = {{ui,...,Un) -.Ui e [u/u,l/n],...,Un-l £ [Un-2/Un-l,l/2],Un G [Un-l/Un, I]} ■ 
By definition, An{a,9){u) = for u = 

For < u < l/(n + l), the Lebesgue measure of D is strictly positive. It follows by direct 
calculation that 

1 " 
limo-,oo-^^ogAn{a,9){u) < esssup{^ log(l - Ur) : (ni,...,?x„) e D} < 0. (3.20) 

r=l 
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On the other hand, by Stirhng's formula, 



1 ^Q. 

log (^) = 0- (3-21) 



Thus for n > 1, and < u < l/(n + 1), 



Mm ^ Ania, 9) = 0. (3.22) 



Let 

k 

Vfc = {(pi,...,Pfc) G Vfc > 0,^pi < 1}. 

i=l 

Now for each fixed {pi, ...,pk) in V^, set 

m = max{j > 1 : p^/Pfc < 

and AQ{a,0){u) = 1 for any < u < 1. 

Then it follows from ([336|) . (|339|) and (13:22]) that 

- ("^)'(^^'=+i)'^'""'g(_i)n+i 'J^ A„_i(a,e)(^) (3.23) 



which implies that 



(Pl---Pfc)^+" Ca^0+(„+fc_l)a ' Pk' 



lim ^log/(pi,...,pfc) = - log- on (3.24) 



Introduce a metric dk on such that for any p, q in 

k 



4(p,q) = '^\Pi- Qi\- 

i=l 

For any 5 > 0, set 

ys{p) = {q G Vfc : 4(p, q) < S} 

Vs{p) = {q G Vfc : 4(p,q) < <5}. 

For every p G V^, one can choose 5 small enough such that Vs{p) C Vs{p) C V^. Let denote 
the Lebesgue measure on V^. Then by Jensen' inequality and p.24p . 

]im llogVeAVsip)} = hmilog^^^M/ /(q)Mciq) (3.25) 



e-^oo 9 ' 0^00 9 fi{Vs{p)) Jvsip) 

Ky^ip)) Jvs(p) 



> TTTT^ I 4(q)/u(dq). 
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Letting S approach zero and using the continuity of Ik{-) at p, one gets 

lim hm llogVe.k{ys{p)} > -4(p). (3.26) 

Since the family {Vg^k : ^ > 0} is exponentially tight, a partial LDP holds (|19j). Let J be any 
rate function associated with certain subsequence of {Vg^k : ^ > 0}. Then it follows from ()3.26p 
that for any p in 

J(p) < 4(p). (3.27) 

Because of the continuity of Ik and the lower semi-continuity of J, (j3.27p holds on Vfc. 
On the other hand for any p in V^, 

hm ilogPe,fc{y5(p)} = lim^log/ /(q)/i((iq) (3.28) 

- ziog/(q5) 

0^00 

= -4(q<5), 

where is in V^, such that 

/(q^) = sup{/(q) : q G ^^(p)}. 
The existence of such q^ is due to the continuity of / over V^. Letting 5 approach zero, one has 

lim hm - \ogVe,k{V5{v)} < -4(p). (3.29) 

Next consider the case that p is such that pk > 0, X]i=iP« = L Then pk/pk = 1- For small 
enough 5, we have 

Qk/qk > 1/2 for q G Vs{p). 
Thus An{a, 9){u) = for all n > 1 on Vs{p) and it follows from (j3.23p that 



hm llogVe,k{ys{p)} = lim i log/ /(q)M'^q) (3.30) 

< lim J log/ (g,+i)^+^"-i ^(dq) 



< log(l - as) 

where as is such that 



Letting S go to zero, one gets 



as = mi{^qi : q G Vsip)} < 1. 
1=1 



lim hm I logVe,k{Vs{p)} < -4(p). (3.31) 
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The only case remains is when there is a ^ < A; such that pi = 0. The upper bound in this case 
is obtained by focusing on a lower dimensional space of the positive coordinates. 
Thus we have shown that for every p in Vfc 

lim lim ^logPe,fc{F5(p)} = lim lim ^ log Pe,fe{t^5(p)} = -/fc(p), (3.32) 
which combined with the exponential tightness implies the result. 

□ 

Let 

oo 

V = {{PUP2, •••) : J^l > P2 > • • • > 0, < 1}. 

i=l 

and for notational simplicity we use Vg to denote the law of F{a,9) on V in the next theorem. 
Then we have 

Theorem 3.4 The family {Vg : 6 > 0} satisfies a LDP with speed 6 and rate function 

OO, else. 

Proof: Because V is compact, the family {Vg : > 0} is exponentially tight. It is thus sufficient 
to verify the local LDP (|19j). The topology on V can be generated by the following metric 



oo I 



1=1 

where p = (pi,p2, •••)) ^ = (^i) 92, •••)• For any fixed 6 > 0, let B{p,5) and B{p,6) denote the 

respective open and closed balls centered at p with radius 6 > 0. Set = 2 + [log2(l/5)] where 
[x] denotes the integer part of x. Set 

Vng{p;6/2) = {{qi,q2, •••) G V : |gi - pi\ < 6/2, i = 1, ...,ns}, 
V{{pi,...,pns);S/2) = {{qi,...,qns) G V„^ : \qi - Pi\ < V2,^ = l,...,^^}. 

Then we have 

Vn,{p;S/2)cB{p,S). 

By lemma 13.31 and the fact that 

Vg{Vn,ip; 6/2)} = Vg,ns{Vi{pi, ...,Pns); ^2)}, 

we get that 

liminfllogPe{5(p,<5)} > liminfilogPe,n,{V^((pi,...,PnJ;V2)} (3.34) 

O^oo u O—i-oo u 

> -Ins{Pl,-,Pns) > -I(.P)- 
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On the other hand for any fixed n > 1, 5i > 0, let 

Un{p;5i) = {{qi,q2,...) G V : |gi - Pi\ < di,i = l,...,n}, 
U{{pi,...,pn);6i) = {(gi,...,gn) G V„ : \qi - pi\ < 6i,i = l,...,n} 

Then we have 

Ve{Un{p; Si)} = Ve,n{U{{pi,...,Pny,di)}, 

and, for 5 small enough, 

B{p,5) c [/„(p;5i), 

which implies that 

limlimsup^log7'9{-B(p,(5)} < limsup ^ log •••,Pn), <5i)} (3.35) 

< -mi{Iniqi,...,qn) ■ {qi,...,qn) G U{{pi, ...,Pn),di)}. 

Letting 5i go to zero, and then n go to infinity, we get 

lim limsup JlogPe{S(p,5)} < -/(p), (3.36) 
which combined with ()3.34p implies the result. 

□ 

4 LDP for Two-Parameter Dirichlet Process 

Let Mi{E) denote the space of all probability measures on E equipped with the weak topology. For 
any diffusive u in Mi{E) with support E, let £,i,(,2,-- be independent and identically distributed 
with common distribution z/. Let '^0^a,u be the two-parameter Dirichlet process defined in (II. 2p . 

Let {(T{t) : t > 0, do = 0} be a subordinator with Levy measure x~^^^°'^e~^dx, x > 0, and 
{r(t) : t > 0, To = 0} be a gamma subordinator that is independent of {at : t > 0, cjo = 0} and has 
Levy measure x~^e~^dx, x > 0. 



Lemma 4.1 (Pitman and Yor) Let 



^{a,e) = —^ (4.37) 
1 1 — a 



For each n > 1, and each partition < ti < • • • < t„ = 1 of E, let Ai = (tj_i,tj] for i = 2, ...,n, 
Ai = [0,ti], and aj = jy{Aj). Set 

Ya,e{t) = a{j{a,e)t),t>0. 
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Then the distribution of {'Eg^a^u{Ai), ...,'Eg^a^u{An)) is the same as the distribution of 

Proof: Proposition 21 in [18] gives the subordinator representation for PD{6,a). The lemma 
fohows this representation and the construction outhned on page 254 in [T7]. 

□ 

Let 

^a,e[t) - — ^ — , 

n n—1 

By direct calculation, one has 

iPiX) = log(/.(A) =log^[e^'^W] = / (e^^' - l)x-("+i)e-^dx (4.38) 

Jo 

£(izi^)[l_(l_A)-], A<1 
oo, else. 



and 



L(A) = lim ^log^[e^^('')] (4.39) 

log(^), A<1 
oo, else. 

For any real numbers Ai, ...A„, let A = (Ai, A„). Then by direct calculation 

-log£;[exp{e(A,Z„,e(ti,...,tn))}] = -log£;[nr=i(i^,(e/a)[exp{A.a(l)}]^a^^('/"))] 

= llogii;[exp{(^-^^^V(A.))r(-)}] (4.40) 

1 " 

- A(Ai,...,A„) = -L(— ^5:K^,)V'(A.)). 
a r(l-a)^ 

For (yi, ...,yn) in i?", set 

n 

Jti,..,tAyi,-:yn) = sup {^Aiyi - A(Ai,...,A„)} (4.41) 

Ai,...,A„ 



I -in 

sup {J2Xiyi + -\og[J2i^{Ai){l-X,r]} 



Ai,...,A„G(-oo,l]" , " 
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Theorem 4.2 The family of the laws of Zia^g{ti, ...,tn) on space i?" satisfies a LDP with speed 9 
and rate function l \4-4^ - 

Proof: First note that both function and function L are essentially smooth. Let 

I^A = {(Ai, A„) : A(Ai, A„) < oo}, = interior of V^. 
It follows from (f439]l and that 

n 

Pa = {(Ai,...,A„) :Y^j,(A,)-^--i^iX,)) < 1}. 
1=1 ^ ' 

The fact that v has support E implies that ^{Ai) > for i = 1, ...,n, and 



Pa = {(Ai,...,A„) :^z.(^,)[l-(l-Air] < 1} 

1=1 

= {(Ai,...,A„):A,<l,i = l,...,n}\{(l, ...,!)}, 
= {(Ai,...,A„) : Aj < l,z = l,...,n}. 

Clearly the function A is differentiable on P^ and 

grad{A){\u..., A„) = — -L'{—^ ^ u{A,)^l^{Xi))iu{A^)^p' (Xi), i.(>l„)V''(A„)). 

r(l-a) r(l-a)^ 



A sequence Am approaches the boundary of P^ from inside implies that at least one coordinate 
sequence approaches one. Since the interior of {A : '(/'(A) < oo} is (— cxd, 1) and -0 is essentially 
smooth, it follows that A is steep and thus essentially smooth. The theorem then follows from 
Gartner-Ellis theorem ([5j). 

□ 

For (yi,...,yn) in and {xi,...,Xn) in E'^, define 



(0,...,0), (yi,...,y„) = (0,...,0) 



and 

^ii,..,t„(2;i,--,a;„) = inf{Jti,..,j„(yi, : F{yi,...,yn) = (xi, (4.42) 

Clearly /ti,...,t„(xi, ...,x„) = +oo if Y.k=i is not one. For (xi, ...,x„) satisfying Y.k=i ^k = 1, 
we have 

n 

Ih,..,tA^i,-,Xn) = inf{Jt^_„^j„(o2;i,...,aXn) : a = ^yfc > 0} (4.43) 

k=l 
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n 1 

= mf{ sup {a V AiXi + -log[Vz^(^i)(l - ^i)"} : a > 0} 

(Ai,...,A,Oe(-oo,l]" ,=1 « i=i 

= inf{ sup {o — log a 

(Ai,...,A„)G(-oo,l]" 
n -. n 

- 5] o(l - \i)xi + - log[^ i/(A,)[a(l - XiT]} : a > 0} 

n n 

= mf{a - log a : a > 0} + sup {- log[V i^(Ai)7f ] - V 7^X1} 

(7i,...,7n)ei?!f a i=i i=i 

sup {- iog[^ i^(A,)7r] + 1 - E 

Theorem 4.3 The family of the laws of ■■■,'^e,a,uiAn)) on space E"^ satisfies a LDP 

with speed 9 and rate function 

sup(^,„..,^„)efin {1 log[E?=i <Ai)^f\ 
It,,..,tSxi,:.,Xn) = { ELi^fc = l (4-44) 

00, else 

Proof: Since Jt^^...^t^(0, 0) = 00, the function F is thus continuous on the effective domain 
of Jti,..,t„- The theorem then follows from Lemma l4.1l and the contraction principle (remark (c) of 
Theorem 4.2.1 in [S]). 

□ 

Remark. Let A„ = {{xi, ...,x„) G E'^ : J2?=i = !}• Then the result in Theorem 14.31 holds with 
being replaced by A„. 

Let Bf){E) and Ch{E) denote the sets of bounded measurable functions, and bounded continuous 
functions on E respectively. For each fj, in Mi{E), set 





A 

= sup |- 




/>o,/eC6{£;) a 




= sup { / 




f>OJeBt{E) J 




= sup { / g{x 








= sup { / g{x 







{x)n{dx)}, (4.45) 
sup { / log f{x)i^{dx) + 1 - / fix)is{dx)}, (4.46) 

(4.47) 



where H{v\ij) is the relative entropy of v with respect to /i. 
Lemma 4.4 For any fi in Mi{E), 

r (^) = sup{/t,,..,t J^(^i), /i(A)) : < ti < t2 < • • • < tn = 1; n = 1, 2, ...}. (4.48) 
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Proof: It follows from Tietze's continuous extension theorem and Luzin's Theorem that we can 
replace Cf){E) with Bi,{E) in the definition of This implies that 

r{^i) > sup{/i,,..,j„(^(^i),...,/i(^„)) : < ti < t2 < ••• < tn = l;n = 1,2,...}. 

On the other hand, for each nonnegative / in Cb{E), let 

ti = -,7i = fiti),i = 1, ■■■,n. 
n 



Then 



Uog{({f{x)rv{dx))- f fixMdx) = \im{^log[j2HA^hn-^2^^K^^^^ 



1=1 1=1 

< sup{/ti,..,t„(/u(Ai),...,/x(A„)) : < ti < t2 < • • • < tn = l;n = 1,2,...}, 
which implies 

< sup{/t,,..,t„(/i(yli),...,//(A„)) : < ti < t2 < ••• <tn = l;n = 1,2,...}. 

□ 

Remarks. It follows from the proof of Lemma 14.41 that the supremum in (j4.48p can be taken over 
all partitions with being the continuity points of /i. By monotonically approximating 

nonnegative f{x) with strictly positive functions from above, it follows from the monotone con- 
vergence theorem that the supremum in both (j4.45p and (j4.46p can be taken over strictly positive 
bounded functions, i.e., 

/"(m) = sup {-log(/(/(x))V(dx)) + l- //(xHdx)}, (4.49) 

f>oj&Ct{E) a J J 

/O(^) = sup {[ \ogf{x)v{dx) + l- [ f{x)^i{dx)}. (4.50) 



f>OJ&Bi{E) 

Lemma 4.5 

= H{u\t,) (4.51) 

Proof: If I' is not absolutely continuous with respect to fi, then H(i'\n) = +oo. Let A be a set 
such that n{A) = 0, 1'iA) > and define 



Then 



m, X & A 
1, else 



I^il^') ^ i^(A)logm ^ cx) as m — > oo. 
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Next we assume u ^ fi and denote ^(a^) by ■il){x). By definition, 



Choosing fuix) = iIj{x) A M in the definition of . Since the function j;log a; is bounded below 
for non-negative x, applying the monotone convergence theorem on {x : 'ijj{x) > e~^}, one gets 

I'>{fi) > hm / f m{x) log fM{x)fi{dx) - log / fM{x)f^{dx) = H{iy\i^). (4.52) 

M -^oo J J 

On the other hand, it follows by letting f{x) = e^^^^ in the definition of H{u\fi) that 

H{i^\fi) = sup { / logf{x)u{dx) - log / f{x)fi{dx)}. 
f>o,feB,(E) J J 

Since 

J f{x)fJ-{dx) - l>\og j f{x)fi{dx), 

we get that 

H{l.\^i) > I^ifi) (4.53) 
which combined with (j4.52p implies (|4.51|) . 

□ 

Lemma 4.6 For any // in Mi{E), < ai < 02 < 1; 

r^/") > ^"H/^), (4.54) 
and for any a in (0, 1), one can find ^ in Mi{E) satisfying u <^ fi such that 

r(/i)>/0(^), (4.55) 
Proof: By Holder's inequality, for any < ai < Q!2 < 1, 

— log( / {f{x)r^uidx)) < — log( / (/(x))"V(dx)), (4.56) 



ai J Q!2 
and the inequality becomes strict if f{x) is not constant almost surely under z^. Hence I°'{fJ') is 
non-decreasing in a over (0, 1). It follows from the concavity of logx that 

Uog{ju{x)r<dx)) > J logfixMdx), 

which implies that I^'ifJ-) > I^ip-) for a > 0. 

Next choose in Mi{E) such that v ^ ^ and ^(x) is not a constant with v probability one, 
then /°(/x) > /"/2(^) > for a > 0. 

□ 

We now ready to prove the main result of this section. 
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Theorem 4.7 The family of the laws of 'BQ^a,v on space Mi{E) satisfies a LDP with speed 6 and 
rate function 

Proof: Let {fj{x) : j = 1, 2, ...} be a countable dense subset of Cb{E) in the supremum norm. 
The set {fj{x) : j = 1, 2, ...} is clearly convergence determining on Mi{E). Let \ fj\ = sup^.^^; \ fj{x)\ 
and 

Then C is also convergence determining. 
For any a;, /x in Mi{E), define 



Pi^^,fj') = Y,^\{^^,9j) - {f^,9j)\- (4.57) 

i=i 

Then p is a metric on Mi (E) and generates the weak topology. 
For any (5 > 0, /X G Mi{E), let 

B{fx,d) = {uje Mi{E) : p(u;,iJ,) < S}, B{n,6) = {lo e Mi{E) : p(a;,/x) < S}. 

Since Mi{E) is compact, the family of the laws of Eg^a is exponentially tight. It thus suffices 
to show that 

limliminf ^logP{S(/x,(5)} = lim limsup ^ log P{S(//, J)} = (4.58) 

5— >0 8—*oo u 5— »0 >oo " 

Choose m large enough, one gets 

{uj^Mi{E) : \{u,gj) - {n,gj)\ < 5/2 : j = I, - ■ ■ ,m} ^ B{v,6). (4.59) 
Choose a partition ti , • • • , t„ such that 

s\r£>{\gj{x) - gj{y)\ : x,y G Ai,i = l,---,n;j = 1,- • • ,m} < 6/8. 
Choosing < 5i < and define 

Vti,-,tn{l^,Si) = {{yi,-,yn) e A„ : \yi - /x(A)| < Si,i = 1,- ■ ■ ,n}. 
For any lu in Mi(E), let 

= (a;(Ai),...,w(A„)). 
If F(a;) G T4i,...,t„(/x, (5i), then for j = 1, ...,m 



I (a; 



:9j) - {^^,9j)\ = \J2 gj{x){uj{dx) - n{dx)) 

i=l ■'■^i 



< -+nSi< S/2, 
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which imphes that 



{uj G Mi{E) : FH G Vt,,.,tAf^, 6i)} C {to e Mi{E) : 5,) - {ii,gj)\ < 5/2 : j = 1, ■ • -,171}. 

This combined with (j4.59p imphes that 

{a; G Mi(F) : F{lu) G Vt,,..,tAl^,6i) C 5(^,(5). (4.60) 

Since V4^^...^f„(//, 5i) is open in A„, it fohows from Theorem 14.31 that 

hmhminf ^logP{5(/i,(5)} (4.61) 
(5— >o ff— >oo a 

> limliminf ^logP{cj G Mi(F) : F{u;) G yt,,..,t„(^, 5i)} 

= limhminf ^logP{(H6),Q,,i/(vli),...,H0,Q,,,,(A„)) G Vji,...,t„(;U, 5i)} 

>-/i„..,t„(/i(Ai),...,M^O)>-^"(M)- 

Next we will focus on partitions ti, ...,tn such that ti, are continuity points of fx. We 

denote the collection of all such partitions by P^. This implies that F{uj) is continuous at /i. Hence 
for any 62 > 0, one can choose 6 > small enough such that 

B{f,,6)cF-'{Vt,,..,t,if^,52)}. 

Let 

^ii,-A(^''^2) = {{yi, •••,yn) e A„ : |yi - n{Ai)\ < 62,1 = l, - ■ ■ ,n - l}. 
Then we have 



limlimsup^logP{S(;U,<5)} (4.62) 

1 _ 

< limSUp-logP{(H6),Q,,i/(^l),---,Se,a,i/(^n)) G l^ti,-,t„(/i,'^2)}. 

9^00 ^ 

By letting 82 go to zero and applying Theorem 14.31 again, one gets 

lim limsup^logP{;B(/i,5)} < (/x(Ai), /x(A„)). (4.63) 

Finally, taking supremum over and taking into account the remark after Lemma 14.41 one 
gets 

lim limsup \ log P{B{fi, 6)} < -I^ifi), (4.64) 
which, combined with (j4.6ip . implies the theorem. 

□ 
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5 Further Comments 



Our results show that the LDPs for GEM{6,a) and PD{9,a) have the same rate function. Since 
GEM{6, a) and PD{6, a) differs only by the ordering, one would expect to derive the LDP for one 
from the LDP for the other. Unfortunately the ordering operation is not continuous and it is not 
easy to establish an exponential approximation. The LDPs for GEM{0, a) and PD{9, a) also have 
the same rate function as the LDPs for GEM (6) and PD{0). Thus a does not play a role in these 
LDPs. This is mainly due to the topology used. It will be interesting to investigate the possibility 
of seeing the role of a through establishing the corresponding LDPs on a stronger topology. 

The LDPs for 'Bq^oi,v and S^i^i, have respective rate functions and Both and 

'Ee^u converge to u for large 0. When 6 becomes large, each Pi{9, a) is more likely to be small. The 
introduction of positive a plays a similar role. Thus the mass in 'Eq^oi,v spreads more evenly than 
the mass in 'Bq^i,. Intuitively S^^q:,!/ is "closer" to u than S^^j,. This observation is made rigorous 
through the fact that /"(•) can be strictly bigger than /°(-)- The monotonicity of /"(•) in a shows 
that a can be used to measure the relative "closeness" to u among all 'r.o^a,^ for large 0. 

The process Ya.et*) is a process with exchangeable increments. One could try to establish a 
general LDP result for processes with exchangeable increments and derive the result in Section 4 
through contraction principle. The proofs here illustrate most of the procedures needed for pursuing 
such a general result from which the LDP for r.Q^a,v follows. 
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